Estimating the error at given test input points for linear regression
نویسنده
چکیده
In model selection procedures in supervised learning, a model is usually chosen so that the expected test error over all possible test input points is minimized. On the other hand, when the test input points (without output values) are available in advance, it is more effetive to choose a model so that the test error only at the test input points at hand is minimized. In this paper, we follow this idea and derive an estimator of the test error at the given test input points for linear regression. Our estimator is proved to be an unbiased estimator of the test error at the given test input points under certain conditions. Through the simulations with artificial and standard benchmark data sets, we show that the proposed method is successfully applied in test error estimation and is compared favorably to the standard cross-validation and an empirical Bayesian method in ridge parameter selection.
منابع مشابه
A matrix method for estimating linear regression coefficients based on fuzzy numbers
In this paper, a new method for estimating the linear regression coefficients approximation is presented based on Z-numbers. In this model, observations are real numbers, regression coefficients and dependent variables (y) have values for Z-numbers. To estimate the coefficients of this model, we first convert the linear regression model based on Z-numbers into two fuzzy linear regression mode...
متن کاملپیشبینی قیمتهای نقدی گازطبیعی به کمک مدلهای غیرخطی ناپارامتریک
Developing models for accurate natural gas spot price forecasting is critical because these forecasts are useful in determining a range of regulatory decisions covering both supply and demand of natural gas or for market participants. A price forecasting modeler needs to use trial and error to build mathematical models (such as ANN) for different input combinations. This is very time consuming ...
متن کاملDifferenced-Based Double Shrinking in Partial Linear Models
Partial linear model is very flexible when the relation between the covariates and responses, either parametric and nonparametric. However, estimation of the regression coefficients is challenging since one must also estimate the nonparametric component simultaneously. As a remedy, the differencing approach, to eliminate the nonparametric component and estimate the regression coefficients, can ...
متن کاملکاربرد سیستمهای تکاملی در تعیین ضریب دبی سرریزهای کنگرهای مثلثی
A labyrinth weir is a nonlinear weir folded in the plan-view which increases the crest length and the flow rate for a given channel width and an upstream flow depth. Nowadays, a labyrinth weir is an attractive alternative for those weirs that have a problem in passing the probable maximum flood. The three-dimensional flow pattern and unlimited geometric parameters provide a major challenge to t...
متن کاملDetermination of the linear and non-linear relationships between soil erodibility factor and effective parameters on it in a mountainous watershed with severe soil erosion
Soil erodibility factor is a criterion of soil particle resistance to detachment, transport, and effects of erosivity factors (rain drop, runoff, and wind) during the soil loss processes. In this study, non-linear support vector machines (SVMs) method was used for investigating the effects of some topography, soil physical and mechanical properties on soil erodibility in a part of Northern Karo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004